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Abstract 

Large  scale  real-world  network  data  such  as  social  and  information  networks  are  ubiquitous.  The 
study  of  such  social  and  information  networks  seeks  to  find  patterns  and  explain  their  emergence  through 
tractable  models.  In  most  networks,  and  especially  in  social  networks,  nodes  have  a  rich  set  of  attributes 
(e.g.,  age,  gender)  associated  with  them. 

Here  we  present  a  model  that  we  refer  to  as  the  Multiplicative  Attribute  Graphs  (MAG),  which 
naturally  captures  the  interactions  between  the  network  structure  and  the  node  attributes.  We  consider  a 
model  where  each  node  has  a  vector  of  categorical  latent  attributes  associated  with  it.  The  probability  of 
an  edge  between  a  pair  of  nodes  then  depends  on  the  product  of  individual  attribute-attribute  affinities. 
The  model  yields  itself  to  mathematical  analysis  and  we  derive  thresholds  for  the  connectivity  and  the 
emergence  of  the  giant  connected  component,  and  show  that  the  model  gives  rise  to  networks  with  a 
constant  diameter.  We  analyze  the  degree  distribution  to  show  that  MAG  model  can  produce  networks 
with  either  log-normal  or  power-law  degree  distributions  depending  on  certain  conditions. 


1  Introduction 

With  the  emergence  of  the  Web,  large  online  social  computing  applications  have  become  ubiquitous,  which 
in  turn  gave  rise  to  a  wide  range  of  massive  real-world  social  and  information  network  data  such  as  social 
networks,  computer  networks,  Internet  networks,  communication  networks,  e-mail  interactions,  Web  graphs, 
and  so  on.  The  unifying  theme  of  studying  real-world  networks  is  to  find  patterns  of  connectivity  and  explain 
them  through  models.  The  main  objective  is  to  answer  questions  such  as  “What  do  real  graphs  look  like?”, 
“How  do  they  evolve  over  time?  “How  can  we  synthesize  realistic  looking  graphs?”,  “How  can  we  find 
models  that  explain  the  observed  patterns?”,  and  “What  are  algorithmic  consequences  of  the  observations 
and  models?”. 

Research  on  empirical  observations  about  the  structure  of  networks  and  the  models  giving  rise  to  such 
structures  go  hand  in  hand.  The  empirical  analysis  of  large  real-world  networks  aims  to  discover  com¬ 
mon  structural  properties  or  patterns,  such  as  heavy-tailed  degree  distributions  ffT5l  HTIl.  local  clustering  of 
edges  rf43l[30l,  small  diameter  |[3]  [28) ,  navigability  f36il22i.  emergence  of  community  structure  lf29l.  and  so 
on. 

In  parallel,  there  have  been  efforts  to  develop  the  network  formation  mechanisms  that  naturally  gen¬ 
erate  networks  with  the  observed  structural  features.  In  these  network  formation  mechanisms,  there  have 
been  two  relatively  dichotomous  modeling  approaches.  Broadly  speaking,  the  theoretical  computer  science 
and  physics  community  have  mainly  focused  on  relatively  simple  “mechanistic”  but  analytically  tractable 

*A  short  version  of  this  paper  appeared  in  Proceedings  of  the  Seventh  Workshop  on  Algorithms  and  Models  for  the  Web  Graph 
( WAW'10 )  Q3. 
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network  models  where  connectivity  patterns  observed  in  the  real-world  naturally  emerge  from  the  model. 
The  prime  example  in  this  line  of  work  is  the  Preferential  Attachment  model  with  its  many  variants  (H  ;T 
mmm,  which  specifies  a  simple  but  very  natural  edge  creation  mechanism  that  in  the  limit  leads  to  net¬ 
works  with  power-law  degree  distributions.  Other  models  of  similar  flavor  include  the  Copying  Model  (23). 
the  Small- world  model  I43ll22l.  Geometric  Random  Graphs  flTTl.  the  Forest  Fire  model  [f28ll.  the  Random 
surfer  model  0,  and  models  of  bipartite  affiliation  networks  l24l.  On  the  other  hand,  in  statistics,  ma¬ 
chine  learning  and  traditional  social  network  analysis,  a  different  approach  to  modeling  network  data  has 
emerged.  There  the  effort  is  in  the  development  of  statistically  sound  models  that  consider  the  structure  of 
the  network  as  well  as  the  features  ( e.g .,  age,  gender)  of  nodes  and  edges  in  the  network.  Examples  of  such 
models  include  the  Exponential  Random  Graphs  (42),  the  Stochastic  Block  Model  (2),  and  the  Latent  Space 
Model  m. 

“Mechanistic”  and  “Statistical”  models.  Generally,  there  has  been  some  gap  between  the  above  two 
lines  of  research.  The  “mechanistic”  models  are  analytically  tractable  in  a  sense  that  one  can  mathemati¬ 
cally  analyze  properties  of  the  networks  that  arise  from  the  models.  These  models  emphasize  the  natural 
emergence  of  networks  that  have  certain  structural  properties  found  in  real-world  networks.  Flowever,  such 
models  are  usually  not  statistically  interesting  in  a  sense  that  they  do  not  nicely  lend  themselves  to  model 
parameter  estimation  and  are  generally  too  simplistic  to  model  heterogeneities  between  individual  nodes. 

On  the  contrary,  “statistical”  models  are  generally  analytically  intractable  and  the  network  properties 
do  not  naturally  emerge  from  the  model  in  general.  Flowever,  these  models  are  usually  accompanied  by 
statistical  procedures  for  model  parameter  estimation  and  very  useful  for  testing  various  hypotheses  about 
the  interaction  of  connectivity  patterns  and  the  properties  of  nodes  and  edges. 

Although  models  of  network  structure  and  formation  are  seldom  both  analytically  tractable  and  statis¬ 
tically  interesting,  an  example  of  a  model  satisfying  both  features  is  the  Kronecker  graphs  model  Il26ll44l. 
which  is  based  on  the  recursive  tensor  product  of  small  graph  adjacency  matrices.  The  Kronecker  graphs 
model  is  analytically  tractable  in  a  sense  that  one  can  analyze  global  structural  properties  of  networks  that 
emerge  from  the  model  Il32l[25ll6l.  In  addition,  this  model  is  statistically  meaningful  because  there  exists 
an  efficient  parameter  estimation  technique  based  on  maximum  likelihood  671120).  It  has  been  empirically 
shown  that  with  only  four  parameters  Kronecker  graphs  quite  accurately  model  the  global  structural  proper¬ 
ties  of  real-world  networks  such  as  degree  distributions,  edge  clustering,  diameter  and  spectral  properties  of 
the  graph  adjacency  matrices. 

Modeling  networks  with  rich  node  attribute  information.  Network  models  investigate  edge  creation 
mechanisms,  but  generally  a  rich  set  of  attributes  is  associated  with  each  node.  This  is  especially  true  in 
social  networks,  where  not  only  people’s  connections  but  also  their  characteristics,  like  age,  gender,  work 
place,  habits,  etc.,  have  been  collected.  Similarly,  various  types  of  profile  information  is  provided  by  users 
in  online  social  networks.  In  this  sense,  both  node  characteristics  and  the  network  structure  need  to  be 
considered  simultaneously. 

The  attempt  to  model  the  interaction  between  the  network  structure  and  node  attributes  raises  a  wide 
range  of  questions.  For  instance,  how  do  we  account  for  the  heterogeneity  in  the  population  of  the  nodes  or 
how  do  we  combine  node  features  in  an  interesting  way  to  obtain  probabilities  of  individual  links?  While 
the  earlier  work  on  a  general  class  of  latent  space  models  formulated  such  questions,  most  resulting  models 
were  either  analytically  tractable  but  statistically  uninteresting  or  statistically  very  powerful  but  do  not  lend 
themselves  to  mathematical  analysis. 

To  bridge  this  gap,  we  propose  a  class  of  stochastic  network  models  that  we  refer  to  as  Multiplicative 
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Attribute  Graphs  (MAG).  The  model  naturally  captures  the  interactions  between  the  network  structure  and 
the  node  attributes  in  a  clean  and  tractable  manner.  We  consider  a  model  where  each  node  has  a  vector 
of  categorical  attributes  associated  with  it.  Individual  attributes  of  nodes  are  then  combined  in  order  to 
model  the  emergence  of  links.  The  model  allows  for  rich  interaction  between  node  features  in  a  sense  that 
one  can  simultaneously  model  features  that  reflect  homophily  (i.e.,  love  of  the  same)  as  well  as  heterophily 
(i.e. ,  love  of  the  different).  For  example,  if  people  share  certain  features  like  hobby,  they  are  more  likely  to  be 
friends.  However,  for  some  other  features  like  gender,  people  may  be  more  likely  to  form  a  relationship  with 
someone  with  the  opposite  characteristic.  The  proposed  MAG  model  is  designed  to  capture  both  homophily 
and  heterophily  that  naturally  occur  in  social  networks. 

We  proceed  by  formulating  the  model  and  show  that  it  is  both  analytically  tractable  and  statistically 
interesting.  In  the  following  sections,  we  present  our  mathematical  results.  Section  [3]examines  the  number 
of  edges  and  shows  that  our  model  naturally  obeys  the  Densification  Power  Law  lf28l.  Section [4] examines 
the  connectivity  of  MAG  model,  which  includes  the  conditions  not  only  when  the  network  contains  a  giant 
connected  component  but  also  when  it  becomes  connected.  Section  [5]  shows  that  the  diameter  of  the  MAG 
model  remains  small  even  though  the  number  of  nodes  is  large.  Section  [6]  shows  that  networks  emerg¬ 
ing  from  the  MAG  model  have  a  log-normal  degree  distribution.  Furthermore,  Section  [7]  describes  a  more 
general  version  of  the  model  that  can  also  capture  the  power-law  degree  distribution.  We  view  this  as  partic¬ 
ularly  interesting  in  the  light  of  a  long-standing  debate  about  how  to  distinguish  the  power-law  distribution 
from  the  log-normal  distribution  in  empirical  data  ll37l[38ll  and  what  implications  this  would  make  for  real- 
world  networks.  Also,  our  results  imply  that  the  MAG  model  model  is  flexible  in  a  sense  that  networks  with 
very  different  properties  emerge  depending  on  the  parameter  configuration.  Finally,  Section  [8]  verifies  the 
properties  of  the  MAG  model  by  simulation  experiments.  The  results  of  the  simulations  examine  how  the 
synthetic  network  changes  depending  the  parameters  as  well  as  how  similar  the  network  looks  to  real-world 
networks. 

2  Formulating  of  the  Multiplicative  Attribute  Graph  (MAG)  model 

In  this  section,  we  begin  with  the  introduction  of  the  Multiplicative  Attribute  Graph  (MAG)  model.  Then,  we 
formulate  the  general  version  of  MAG  model  and  present  the  simplified  version  that  we  analyze  throughout 
this  paper.  Finally,  we  investigate  the  connection  to  some  related  works. 

2.1  General  considerations 

We  consider  a  setting  where  each  node  u  has  a  vector  a(u )  of  l  categorical  (e.g.,  binary)  attributes  associated 
with  it.  For  simple  examples,  one  can  think  of  such  attribute  vectors  as  a  sequence  of  answers  to  l  yes/no 
questions  such  as  “Are  you  female?”,  “Do  you  like  ice  cream?”,  and  so  on. 

The  other  essential  ingredient  of  our  model  is  to  specify  a  mechanism  that  generates  the  probability  of 
an  edge  between  two  nodes  based  on  their  attribute  vectors.  As  mentioned  before,  we  aim  to  be  able  to 
account  for  the  homophily  of  certain  features  as  well  as  the  heterophily  of  the  others  by  our  model.  For  this 
mechanism,  we  associate  each  attribute  i  (i.e.,  i-th  question)  with  an  attribute-attribute  affinity  matrix  0,. 
Each  entry  of  matrix  0j  represents  the  affinity  depending  on  the  values  of  the  /-th  attribute  between  a  pair  of 
nodes.  More  precisely,  ®i[z\,  Z2]  indicates  the  affinity  between  a  pair  of  nodes,  each  of  which  respectively 
takes  value  z\  and  Z2  for  its  i- th  attribute.  For  the  binary  attribute  example  in  Figure  [Q  each  0,  is  a  2  x  2 
matrix.  To  obtain  the  affinity  corresponding  to  the  i- th  attribute  between  node  u  and  v,  the  values  of  z-th 
attribute  of  both  nodes  select  an  appropriate  cell  of  0,;. 
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P(u,v)  =  a,  •  p2  •  y3  ■  a4 


Figure  1:  Schematic  representation  of  the  Multiplicative  Attribute  Graphs  (MAG)  model.  Given  a  pair 
of  nodes  u  and  v  with  the  corresponding  binary  attribute  vectors  a(u)  and  a(v),  the  probability  of  edge 
P[u,v]  is  the  product  over  the  entries  of  attribute-attribute  affinity  matrices  0,  where  values  of  at (u)  and 
a,i(v)  “select”  the  appropriate  entries  (row/column)  of  0j.  Note  that  this  visualized  model  represents  the 
undirected  graph  by  make  each  0,  symmetric.  However,  the  MAG  model  in  general  represents  the  directed 
graph. 
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Figure  2:  Structures  in  which  a  node  attribute  can  affect  link  affinity.  The  widths  of  arrows  correspond  to 
the  affinities  towards  link  formation. 


By  these  affinity  matrices,  we  can  capture  the  various  types  of  structure  in  real-world  social  networks. 
For  example,  Figure  [2]  shows  four  possible  linking  affinities  of  a  binary  attribute.  Top  figure  of  each  case 
visualizes  the  general  structure  of  networks.  Each  circle  represents  the  group  which  shares  the  attribute 
value  and  the  width  of  each  arrow  indicates  the  affinity  of  the  link  formation  in  the  given  direction  ( e.g .,  the 
arrow  0  — >•  1  indicates  the  affinity  of  link  formation  between  a  node  with  ”0”-value  of  a  given  attribute  and 
a  node  with  ”l”-value  of  that  attribute.).  Then,  under  each  figure,  we  represent  the  structure  in  the  form  of 
the  affinity  matrix. 

To  investigate  one  by  one.  Figure [2 a)  shows  the  homophily  (love  of  the  same)  attribute  affinity  and  the 
corresponding  affinity  matrix  0.  Notice  large  values  on  the  diagonal  entries  of  0,  which  means  that  link 
probability  is  high  when  nodes  share  the  same  attribute  value.  Top  of  the  figure  demonstrates  that  there 
will  be  many  links  between  nodes  that  have  the  value  of  the  attribute  set  to  “0”  and  many  links  between 
nodes  that  have  the  value  “1”,  but  there  will  be  few  links  between  nodes  where  one  has  value  “0”  and  the 
other  “1”.  Similarly,  Figure  [2jb)  shows  the  heterophily  (love  of  the  different)  affinity,  where  nodes  that 
do  not  share  the  value  of  the  attribute  are  more  likely  to  link,  which  gives  rise  to  near-bipartite  networks. 
Also,  Figure  0c)  shows  the  core-periphery  affinity,  where  links  are  most  likely  to  form  between  “0”  nodes 
(i.e.,  members  of  the  core)  and  least  likely  to  form  between  “1”  nodes  (i.e.,  members  of  the  periphery). 
Notice  that  links  between  the  core  and  the  periphery  are  more  likely  than  the  links  between  the  nodes  of  the 
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periphery.  Additionally,  Figure  |2d)  illustrates  the  uniformly  random  structure  that  the  Erdos-Renyi  random 
graph  model  generates.  By  assiging  the  same  value  into  every  entry  in  each  affinity  matrix,  we  can  build  the 
MAG  model  equivalent  to  the  Erdos-Renyi  random  graphmodel. 

From  these  examples,  we  notice  that  the  MAG  model  nicely  provides  the  flexibility  in  the  network 
structure  via  the  affinity  matrices.  Although  we  presented  the  binary  and  undirected  examples,  the  MAG 
model  basically  allows  more  complicated  structure  with  larger  cardinalities  (e.g.,  3  x  3  or  4  x  4)  as  well  as 
asymmetric  structure  through  asymmetric  affinity  matrices. 

2.2  The  Multiplicative  Attributes  Graph  (MAG)  model 

Now  we  formulate  a  general  version  of  the  MAG  model.  To  start  with,  let  each  node  u  have  a  vector  of  l 
categorical  attributes  and  let  each  attribute  have  cardinality  di  for  i  =  1, 2,  ■  •  ■  l.  We  also  have  l  matrices, 
0,  €  di  x  di  for  i  =  1,2  ,•••/.  Each  entry  of  0,  is  the  affinity  of  a  real  value  between  0  and  1 Q.  Then,  the 
probability  of  an  edge  (u,  v ),  P[u.  v ],  is  defined  as  the  multiplication  of  affinities  corresponding  to  individual 
attributes,  i.  e. , 

i 

P[u,v\  =  [ai(rt),aj(n)]  (1) 

i=l 

where  a,i{u)  denotes  the  value  of  i-tli  attribute  of  node  u.  Note  that  edges  appear  independently  with  proba¬ 
bility  determined  by  node  attributes  and  matrices  0(.  Figure  [T| illustrates  the  model. 

One  can  think  of  the  MAG  model  in  the  following  sense.  In  order  to  construct  a  social  network,  we  ask 
each  node  u  a  series  of  multiple-choice  questions  and  the  attribute  vector  a(u )  stores  the  answers  fo  these 
questions.  The  answers  of  nodes  u  and  v  on  a  question  i  select  an  entry  of  matrix  0,,  i. e. ,  u’s  answer  selects 
a  row  and  v’s  answer  selects  a  column.  One  can  thus  think  of  matrices  0,  ’s  as  the  attribute-attribute  affinity 
matrices.  Assuming  that  the  questions  are  appropriately  chosen  so  that  answers  are  independent  of  each 
other,  the  product  over  the  entries  of  matrices  0,  can  be  regarded  as  the  probability  of  the  edge  between  u 
and  v. 

The  choice  of  multiplicatively  combining  entries  of  0,  is  very  natural.  In  particular,  the  social  network 
literature  defines  a  concept  of  Blau-spaces  041135)1  where  socio-demographic  attributes  act  as  dimensions. 
Organizing  force  in  Blau  space  is  homophily  as  it  has  been  argued  that  the  flow  of  information  between  a 
pair  of  nodes  decreases  with  the  “distance”  in  the  corresponding  Blau  space.  In  this  way,  small  pockets  of 
nodes  appear  and  lead  to  the  development  of  social  niches  for  human  activity  and  social  organization.  In 
this  respect,  multiplication  is  a  natural  way  to  combine  node  attribute  data  (L  e. ,  the  dimensions  of  the  Blau 
space)  so  that  even  a  single  attribute  can  have  profound  impact  on  the  linking  structure  (/.  e.,  it  creates  a 
narrow  social  niche  community). 

The  proposed  MAG  model  model  is  analytically  tractable  in  a  sense  that  we  can  formally  analyze  the 
properties  of  the  model.  Moreover,  the  MAG  model  is  also  statistically  interesting  as  it  can  account  for  the 
heterogeneities  in  the  node  population  and  can  be  used  to  study  the  interaction  between  properties  of  nodes 
and  their  linking  behavior.  Moreover,  one  can  pose  many  interesting  statistical  inference  questions:  Given 
attribute  vectors  of  all  nodes  and  the  network  structure,  how  can  we  estimate  the  values  of  matrices  0,? 
How  can  we  infer  the  attributes  of  unobserved  nodes?  Or,  given  a  network,  how  can  we  estimate  both  the 
node  attributes  and  the  matrices  0,?  However,  the  focus  of  this  paper  is  in  mathematical  analysis  and  we 
leave  the  questions  of  MAG  model  parameter  estimation  for  the  future  work. 

'Note  that  there  is  no  condition  for  0;  to  be  stochastic,  we  only  require  each  entry  of  ©i  to  be  on  interval  (0, 1). 
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2.3  Simplified  version  of  the  model 


Next  we  delineate  a  simplified  version  of  the  model  that  we  will  mathematically  analyze  in  the  further 
sections  of  the  paper.  First,  while  the  general  MAG  model  applies  to  directed  networks,  we  consider  the 
undirected  version  of  the  model  by  requiring  each  0,  to  be  symmetric.  Second,  we  assume  binary  node 
attributes  and  thus  affinity  matrices  0,  have  2  rows  and  2  columns.  Third,  to  further  reduce  the  number 
of  parameters,  we  also  assume  that  the  affinity  matrices  for  all  attributes  are  the  same,  i.e.,  0,;  =  0  for 

a  f3 


all  i.  These  three  conditions  imply  that  0  = 


P  7 


i.e.,  0[O,O]  =  a,  0[0, 1]  =  ©[1,0]  =  P,  and 


©[1,1]  =  7  for  0  <  a,  (3, 7  <  1.  Furthermore,  all  our  results  will  hold  for  a  >  ft  >  7.  The  assumption 
a  >  /3  >  7  is  natural  since  most  large  real-world  networks  have  a  common  onion-like  “core -periphery” 
structure  l29l  [30,  251.  Figure  0c)  exhibits  this  structure.  More  precisely,  the  network  is  composed  from 
denser  and  denser  layers  of  edges  as  one  moves  towards  the  core  of  the  network.  Basically,  a  >  f3  >  7 
means  that  more  edges  are  likely  to  appear  between  nodes  which  share  value  0  on  more  attributes  and 
these  nodes  form  the  core  of  the  network.  Since  more  edges  appear  between  pairs  of  nodes  with  attribute 
combination  “0-1”  than  between  those  with  “1-1”,  there  are  more  edges  between  the  core  and  the  periphery 
nodes  (edges  “0-1”)  than  between  the  nodes  of  the  periphery  themselves  (edges  “1-1”). 

Last,  we  also  assume  a  simple  generative  model  of  node  attributes  where  each  binary  attribute  vector 
is  generated  by  l  independently  and  identically  distributed  coin  flips  with  bias  //.  That  is,  we  use  an  i.i.d. 
Bernoulli  distribution  parameterized  by  //  to  model  attribute  vectors  where  the  probability  that  the  i-th 
attribute  of  a  node  u  takes  value  0  is  P  (a,;  (it)  =  0)  =  //  for  i  =  1,  •  •  •  ,  l  and  0  <  /t  <  1. 

Putting  it  all  together,  the  MAG  model  M(n,l,  //,  0)  is  fully  specified  by  six  parameters:  n  is  the  number 
of  nodes,  /  is  the  number  of  attributes  of  each  node,  /t  is  the  probability  that  an  attribute  takes  a  value  of  1, 
and  0  =  [a  /3;  (3  7]  specifies  the  attribute-attribute  affinity  matrix. 

We  now  study  the  properties  of  the  random  graphs  that  result  from  the  M(n,  l.  p.  0)  where  every  un¬ 
ordered  pair  of  nodes  ( u ,  v)  is  independently  connected  with  probability  P\u,  v]  defined  in  Equation  (Q3  ■ 
Since  the  probability  of  an  edge  exponentially  decreases  in  l,  the  most  interesting  case  occurs  when  l  = 
p  log  n  for  some  constant  pH  This  result  perfectly  agrees  that  the  effective  number  of  dimensions  which  can 
represent  online  social  networks  is  the  order  of  log  n  0. 


2.4  Connections  to  other  models 

We  note  that  our  model  belongs  to  a  general  class  of  latent  space  network  models,  where  nodes  have  some 
discrete  or  continuous  valued  attributes  and  the  probability  of  linking  depends  on  the  values  of  attribute 
of  the  two  nodes.  For  example,  the  Latent  Space  Model  lfl8l  assumes  that  nodes  reside  in  d-dimensional 
Euclidean  space  and  the  probability  of  an  edge  depends  on  the  Euclidean  distance  between  the  locations 
of  the  nodes.  Similarly,  in  Random  Dot  Product  Graphs  l45l.  the  linking  probability  depends  on  the  inner 
product  between  the  vectors  associated  with  node  positions.  Furthermore,  recently  introduced  Multifractal 
Network  Generator  l39l  can  also  be  viewed  as  a  special  case  of  MAG  model  where  the  node  attribute 
distribution  and  the  affinity  matrix  are  equal  for  every  attribute. 

The  MAG  model  generalizes  the  Kronecker  graphs  model  ll25l  in  a  subtle  way.  The  Kronecker  graphs 
model  takes  a  small  (usually  2x2)  initiator  matrix  K  and  tensor-powers  it  l  times  to  obtain  a  matrix  G  of 
size  2l  x  2l,  interpreted  as  the  stochastic  graph  adjacency  matrix.  One  can  think  of  a  Kronecker  graph  model 
as  a  special  case  of  the  MAG  model. 

throughout  the  paper,  log(-)  indicates  log2(-)  unless  explicitly  specified  as  ln(-). 
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Proposition  2.1  A  Kronecker  graph  G  on  2l  nodes  with  a  2x2  initiator  matrix  K  is  equivalent  to  the 
following  MAG  graph  M:  Let  us  number  the  nodes  of  M  as  0,  •  •  •  ,  2;  —  1.  Let  the  binary  attribute  vector  of 
a  node  u  of  M  be  a  binary  representation  of  its  node  id,  and  let  0,  =  K.  Then  individual  edge  probabilities 
(u,  v )  of  nodes  in  G  match  those  in  M,  i.e.,  Pq[u,  n]  =  Pm  [u,  v}. 

The  above  observation  is  interesting  for  several  reasons.  First,  all  results  obtained  for  Kronecker  graphs 
naturally  apply  to  a  subclass  of  MAG  graphs  where  the  node’s  attribute  values  are  the  binary  representation 
of  its  id.  This  means  that  in  a  Kronecker  graph  version  of  the  MAG  model  each  node  has  a  unique  com¬ 
bination  of  attribute  values  (i.e.,  each  node  has  different  node  id)  and  all  attribute  value  combinations  are 
occupied  (i.e.,  node  ids  range  0, . . . ,  2l  —  1). 

Second,  building  on  this  correspondence  between  Kronecker  and  MAG  graphs,  we  also  note  that  the 
estimates  of  the  Kronecker  initiator  matrix  K  nicely  transfer  to  matrix  0  of  MAG  model.  For  example, 
Kronecker  initiator  matrix  K  =  [a  =  0.98, /3  =  0.58,7  =  0.05]  accurately  models  the  graph  of  the 
internet  connectivity,  while  the  global  network  structure  of  the  Epinions  online  social  network  is  captured 
by  K  =  [a  =  0.99,  /3  =  0.53,7  =  0.13]  l27l.  Thus,  in  the  rest  of  the  paper,  we  will  consider  the  above 
values  of  K  as  the  typical  values  that  the  matrix  0  would  normally  take.  In  this  respect,  the  assumption  of 
a  >  /3  >  7  naturally  appears. 

In  following  sections,  we  analyze  the  properties  of  the  MAG  model.  We  focus  mostly  on  the  simplified 
version.  Each  section  states  the  main  theorem  and  gives  the  overview  of  the  proof.  We  omit  the  full  proofs 
in  the  main  body  of  the  paper  and  describe  them  in  the  Appendix. 

3  The  Number  of  Edges 

In  this  section,  we  derive  the  expression  for  the  expected  number  of  edges  in  MAG  model.  Moreover,  this 
formula  can  valdiate  not  only  the  assumption,  l  =  p  log  n,  but  also  a  substantial  social  network  property, 
namely  the  Densification  Power  Law. 

Theorem  3.1  For  a  MAG  graph  M(n,l,  p .  0),  the  number  of  edges,  m,  satisfies 

E  [m\  =  —  —  (p2a  +  2p(l  —  p)/3  +  (1  —  p)2 y)1  +  n  (pa  +  (1  —  p)y)1  . 

The  expression  is  divided  into  two  diffrent  terms.  The  first  term  indicates  the  number  of  edges  between 
distinct  nodes,  whereas  the  second  term  means  the  number  of  self-edges.  If  we  exclude  self-edges,  the 
number  of  edges  would  be  therefore  reduced  to  the  first  term. 

Before  the  actual  analysis,  we  define  some  useful  notations  that  will  be  used  throughout  this  paper. 
First,  let  V  be  the  set  of  nodes  in  the  MAG  graph  M(n,l ,  p,  0).  We  refer  to  the  weight  of  a  node  u  as  the 
number  of  0’s  in  its  attribute  vectors,  and  denote  it  as  |u|  ,  i.e.,\u\  =  Yl\= i  1  {ai(u)  =  0}  where  1  {•}  is  an 
indicator  function.  We  additionally  define  Wj  as  a  set  which  consists  of  all  nodes  with  the  same  weight  j, 
i.e.,  Wj  =  {u  €  V  :  |ti|  =  j}  for  j  =0,1,---  .  1.  Similarly,  Sj  denotes  the  set  of  nodes  with  weight  which 
is  greater  than  or  equal  to  j,  i.e.,  Sj  =  {u  £  V  :  juj  >  j}.  By  definition,  Sj  =  U *  -W*. 

To  complete  the  proof  of  Theorem  13. 11  using  the  definition  of  the  simplified  MAG  model,  we  can  derive 
the  main  lemmas  as  follows: 

Lemma  3.2  For  distinct  u,  v  €  V,  E  [P  [u,  v\  \u  €  W,]  =  (pa  +  (1  —  p)/3)1  (p/3  +  (1  —  p) y)l~l  ■ 

Lemma  3.3  For  u  G  V,  E  [deg(u)\u  €  Wf\  =  (n  —  1)  (pa  +  (1  —  p)/3)1  (p/3  +  (1  —  p)y)l~l  +  2alyl~l  . 
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By  using  these  lemmas,  the  outline  of  the  proof  for  Theorem  13.  II  is  as  follows.  Since  the  number  of 
edges  is  half  of  the  degree  sum,  all  we  need  to  do  is  to  sum  E  [deg(u)]  over  the  degree  distribution.  However, 
because  E  [deg{u)]  =  E  [deg{v)\  if  the  weights  of  u  and  v  are  the  same,  we  can  add  up  E  [deg{u)\u  £  W,] 
over  the  weight  distribution,  /.<?.,  binomial  distribution  Bin  {l,  /.i). 

On  the  other  hand,  more  significantly,  Theoreml3.1lcan  result  in  two  substantial  features  of  MAG  model. 
First,  the  assumption  that  l  =  p  log  n  for  a  constant  p  can  be  validated  by  the  next  two  corollaries. 

Corollary  3.3.1  m  €  o(n)  with  high  probability  as  n  ->•  oo,  if  ^  >  -iog(^a+2At(i-At)^+(i-At)^7)  • 
Corollary  3.3.2  m  €  0(n2_oW)  with  high  probability  as  n  —y  oc,  if  l  <G  o(logn). 

Note  that  log  ( p2a  +  2p{l  —  p)/3  +  (1  —  p)2 7)  <  0  because  both  p  and  7  are  less  than  1.  Thus,  in 
order  for  M{n,l,  p,&)  to  have  a  proper  number  of  edges  (e.g.,  more  than  n),  l  should  be  bounded  by 
the  order  of  log  n.  On  the  contrary,  since  most  social  networks  are  sparse,  l  €  o(log  n)  case  can  be  also 
reasonably  excluded.  In  consequence,  both  Corollarvl3.3.1land  Corollary l3.3.2|provide  the  upper  and  lower 
bounds  of  l  for  social  networks.  These  bounds  eventually  support  the  assumption  of  l  =  p  log  n. 

Although  we  do  not  technically  define  any  process  of  MAG  graph  evolution,  we  can  interpret  it  in 
the  folllowing  way.  When  a  new  node  joins  the  network,  its  behavior  is  governed  by  the  node  attribute 
distribution  which  is  seemingly  independent  of  the  network  structure.  However,  in  a  long  term,  since  the 
number  of  attributes  grows  slowly  as  the  number  of  nodes  increases,  the  node  attributes  and  the  network 
structure  are  not  independent.  This  phenomenon  is  somewhat  aligned  with  the  real  world.  When  a  new 
person  enters  the  network,  he  or  she  seems  to  act  independently  of  other  people,  but  people  eventually 
constitue  a  structured  network  in  the  large  scale  and  their  behaviors  can  be  categorized  into  more  classes  as 
the  network  evolves. 

Second,  under  this  assumption,  the  expected  number  of  edges  can  be  approximately  restated  as 

In2+Plog(M2a+2A‘(1-^)/3+(1-/i)27) 

2 

We  find  that  this  fact  agrees  with  the  Densification  Power  Law  ll28ll.  one  of  the  properties  of  social  networks, 
which  indicates  m(t)  oc  n{t)a  for  a  >  1.  For  example,  an  instance  of  MAG  model  with  p  =  l,p  =  0.5 
(Proposition  12.11),  would  have  the  densification  exponent  a  =  log(|0|)  where  |0|  denotes  the  sum  of  all 
entries  in  0. 

The  proofs  are  fully  described  in  Appendix. 


4  Connectivity 


In  the  previous  section,  we  observed  that  MAG  model  obeys  the  Densification  Power  Law.  In  this  section, 
we  mathematically  investigate  MAG  model  for  another  general  property  of  social  networks,  the  existence 
of  a  giant  connected  component.  Furthermore,  we  also  examine  the  situation  where  this  giant  component 
covers  the  entire  network,  i.e.,  the  network  is  connected. 

We  begin  with  the  theorems  that  MAG  graph  has  a  giant  component  and  further  becomes  connected. 


Theorem  4.1  (Giant  Component)  Only  one  connected  component  of  size  @(n)  exists  in  M  (n,  l.  p,  0)  with 
high  probability  as  n  ^  00  ,  if  and  only  if 


{pa  +  (1  -  p)PY  {p/3  +  (1  -  ^b)1  ^ 


’It  indicates  the  probability  1  —  o(l). 


Theorem  4.2  (Connectedness)  Let  the  connectedness  criterion  function  of  M  (n.  1.  p,  0)  he 


FC(M ) 


(/i/3  +  (1  -  /i)7 Y 

(/ia  +  (1  -  n)p)v  (p/3  +  (1  -  p)'y)1~u 


when  (1  —  p)p  >  \ 
otherwise 


p 

=  \  in  (0,  if). 

Then,  M (n,  l,  it,  0)  is  connected  with  high  probability  as  n  — >•  oo,  if  FC  (M)  >  In  contrast,  M (n,  l,  /i,  0)  is 
disconnected  with  high  probability  as  n  —y  oc,  if  FC(M)  <  i. 


where  v  is  a  solution  of 


(sre 


i-i/ 


To  show  the  above  theorems,  we  first  define  the  monotonicity  property  of  MAG  model. 

Theorem  4.3  (Monotonicity)  For  u,v  £  V,  P  [u,  n||?i|  =  i]  <  P  [u,  n||n|  =  j]  if  i  <  j. 

Theorem  14.31  ultimately  demonstrates  that  a  node  of  larger  weight  is  more  likely  to  be  connected  with 
other  nodes.  In  other  words,  a  node  of  large  weight  plays  a  ’’core”  role  in  the  network,  whereas  the  node  of 
small  weight  is  regarded  as  ’’periphery”.  This  feature  of  the  MAG  model  has  direct  effects  on  the  connect¬ 
edness  as  well  as  on  the  existence  of  a  giant  component. 

By  the  monotonicty  property,  the  minimum  degree  is  likely  to  be  the  degree  of  the  minimum  weight 
node.  Therefore,  the  disconnectedness  could  be  proved  by  showing  that  the  expected  degree  of  the  minimum 
weight  node  is  too  small  to  be  connected  with  any  other  node.  Conversely,  if  this  lowest  degree  is  large 
enough,  say  fl(log  n),  then  any  subset  of  nodes  would  be  connected  with  the  other  part  of  the  graph.  Thus, 
to  show  the  connectedness,  the  degree  of  the  minimum  weight  node  should  be  necessarily  inspected,  using 
Lem  m  a  [331 

Note  that  the  criterion  in  Theoreml4.2lis  separated  into  two  cases  depending  on  p,  which  tells  whether  or 
not  the  expected  number  of  weight  0  nodes,  E  [| ICo|],  is  greater  than  1,  because  \Wj\  is  a  binomial  random 
variable.  If  this  expectation  is  larger  than  1,  then  the  minimum  weight  is  likely  to  be  close  to  0,  i. e. ,  0(1). 
Otherwise,  if  E  [|Wo|]  <  1,  the  equation  of  v  describes  the  ratio  of  the  minimum  weight  to  /  as  n  oc. 
Therefore,  the  condition  for  connectedness  actually  depends  on  the  minimum  weight  node.  In  fact,  the  proof 
of  Theorem  14. 2lis  accomplished  by  computing  the  expected  degree  of  this  minimum  weight  node  and  using 
some  techniques  introduced  in  lf32l. 

Similar  explanation  works  for  the  existence  of  a  giant  component.  Instead  of  the  minimum  weight  node, 
Theorem  14.11  shows  that  the  existence  of  0(n)  component  relies  on  the  degree  of  the  median  weight  node. 
We  intuitively  understand  this  in  the  following  way.  We  might  throw  away  the  lower  half  of  nodes  by 
degree.  If  the  degree  of  the  median  weight  node  is  large  enough,  then  the  half  of  the  network  is  likely  to 
be  connected.  The  connectedness  of  this  half  network  implies  the  existence  of  0  (n)  component,  the  size  of 
which  is  at  least  l|.  In  the  proof,  we  actually  examine  the  degrees  of  nodes  of  three  different  weights:  pi, 
pi  +  l1  F,  and  pi  +  l 2/3.  The  existence  of  0(n)  component  is  determined  by  the  degrees  of  these  nodes. 

However,  the  existence  of  0(n)  component  does  not  necessarily  indicate  that  it  is  a  unique  giant  com¬ 
ponent,  since  there  might  be  another  0(n)  component.  Therefore,  to  prove  Theorem  14.11  more  strictly,  the 
uniqueness  of  0(n)  component  has  to  follow  the  existence  of  it.  We  can  prove  the  uniqueness  by  show¬ 
ing  that  if  there  are  two  connected  subgraphs  of  size  0(n)  then  they  are  connected  each  other  with  high 
probability. 

The  proofs  of  those  three  theorems  are  in  Appendix. 
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5  Diameter 


Another  property  of  social  networks  is  that  the  diameter  of  the  network  remains  small  although  the  number 
of  nodes  grows  large.  We  can  show  this  property  in  MAG  model  by  applying  the  similar  idea  as  in  ||32il. 

Theorem  5.1  If  (pj 3  +  (1  —  p)y)p  >  then  M(n,  l ,  p,  0)  has  a  constant  diameter  with  high  probabil¬ 
ity  as  n  — >•  oo. 

This  theorem  does  not  specify  the  exact  diameter,  but,  under  the  given  condition,  it  guarantees  the 
bounded  diameter  even  though  n  — >•  oo  by  using  the  following  lemmas: 

Lemma  5.2  If  (p/3  +  (1  —  p)y)p  >  for  A  =  S\i  h as  a  constant  diameter  with  high  proba¬ 

bility  as  n  — >•  oo. 

Lemma  5.3  If  (p/3  +  (1  —  p) y)p  >  for  A  =  ,  all  nodes  in  V\S\i  are  directly  connected  to 

S\i  with  high  probability  as  n  — >•  oo. 

By  Lemma  1531  we  can  conclude  that  the  diameter  of  the  entire  graph  is  limited  to  (2+  diameter  of  S\i). 
Since  by  Lemma  15.21  the  diameter  of  S\i  is  constant  with  high  probability  under  the  given  condition,  the 
actual  diameter  is  also  constant. 

The  proofs  are  represented  in  Appendix. 


6  Degree  Distribution 


In  this  section,  we  analyze  the  degree  distribution  of  the  simplified  MAG  model  under  some  reasonable 
assumptions^  Depending  on  0,  MAG  model  produces  graphs  of  various  degree  distributions.  For  instance, 
since  the  network  becomes  a  sparse  Erdos-Renyi  random  graph  if  a  «  j3  «  7  <  1,  the  degree  distribution 
will  approximately  follow  the  binomial  distribution.  For  another  extreme  example,  in  case  of  a  ~  1  and 
p  «  1,  the  network  will  be  close  to  a  complete  graph,  which  represents  a  degree  distribution  different  from 
a  sparse  Erdos-Renyi  random  graph.  For  this  reason,  we  need  to  narrow  down  the  conditions  on  p  and  0  as 
follows.  If  p  is  close  to  0  or  1,  then  the  graph  becomes  an  Erdos-Renyi  random  graph  with  edge  probability 
p  =  a  (when  p  «  1)  or  7  (when  p  ~  0).  Since  the  degree  distribution  of  the  Erdos-Renyi  random  graph  is 
binomial,  we  will  exclude  these  extreme  cases  of  p.  On  the  other  hand,  with  regard  to  0,  we  assume  that 
a  reasonable  configuration  space  for  0  would  be  where  is  between  1.6  and  3.  For  the  previous 

Kronecker  graph  example,  this  ratio  is  actually  about  2.44.  Our  approach  for  the  condition  on  0  can  be 
also  supported  by  real  examples  in  ETI.  This  condition  is  crucial  for  us,  since  in  the  analysis  we  use  that 

( ;A+(  1  -I07)  8rows  faster  than  the  polynomial  function  of  x.  If  is  close  to  1,  we  cannot  make 

use  of  this  fact.  Assuming  all  these  conditions  on  p  and  0,  we  result  in  the  following  theorem  about  the 
degree  distribution. 


Theorem  6.1  In  M(n ,  l,  // .  0)  that  follows  above  assumptions,  if 


(pa  +  (1  -  p)/3)p  (p/3  +  (1  -  p)y) 


1  -p 


> 


2  ’ 


4We  trivially  exclude  self-edges  not  only  because  computations  become  simple  but  also  because  other  models  usually  do  not 
include  them. 
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then  the  tail  of  degree  distribution,  pk,  follows  a  log-normal,  specifically, 
N  An  (n(pfi  +  (1  -  p)y)1)  +  Ip  In  R  +  M1"/*)^)2 


,  lp(  1  —  p)(\nR)‘ 


_  fia+( l-d)P 


as  n 


oo. 


for  R  — 

In  other  words,  the  degree  distribution  of  MAG  model  approximately  follows  a  quadratic  relationship  on 
log-log  scale.  This  result  is  nice  since  some  social  networks  follow  the  log-normal  distribution.  For  instance, 
the  degree  distribution  of  Live  Journal  network  looks  more  parabolic  than  linear  on  log-log  scale  OTl. 

In  brief,  as  the  expected  degree  is  an  exponential  function  of  the  node  weight  by  Lemma l33l  the  degree 
distribution  is  mainly  affected  by  the  distribution  of  node  weights.  Since  the  node  weight  follows  a  binomial 
distribution,  it  can  be  approximated  to  a  normal  distribution  for  sufficiently  large  l.  Because  the  logarithmic 
value  of  the  expected  degree  is  linear  in  the  node  weight  and  this  weight  follows  a  binomial  distribution,  the 
log  value  of  degree  approximately  follows  a  normal  distribution  for  large  l.  This  eventually  indicates  that 
the  degree  distribution  roughly  follows  a  log-normal. 


Note  that  there  exists  a  condition, 


l  p 


> 


which  is  related  to 


(pa  +  (1  -  p)PY  (p/3  +  (1  -  /lb)1  M 

the  existence  of  a  giant  component.  First,  this  condition  is  perfectly  acceptable  because  real-world  networks 
have  a  giant  component.  Second,  as  we  described  in  SectionlH  this  condition  ensures  that  the  median  degree 
is  large  enough.  Equivalently,  it  also  indicates  that  the  degrees  of  a  half  of  the  nodes  are  large  enough.  If  we 
refer  to  the  tail  of  degree  distribution  as  the  degrees  of  nodes  with  degrees  above  the  median  degree,  then 
we  can  show  Theoreml6.ll 

The  full  proofs  for  this  analysis  are  described  in  Appendix. 


7  Extensions:  Power-Law  Degree  Distribution 

So  far  we  have  handled  the  simplified  version  of  MAG  model  parameterized  by  only  few  variables.  Even 
with  these  few  parameters,  many  well-known  properties  of  social  networks  can  be  reproduced.  However,  re¬ 
garding  to  the  degree  distribution,  even  though  the  log-normal  is  one  of  the  distributions  that  social  networks 
commonly  follow,  a  lot  of  social  networks  also  follow  the  power-law  degree  distribution  I1311. 

In  this  section,  we  show  that  the  MAG  model  produces  networks  with  the  power-law  degree  distribution 
by  releasing  some  constraints.  We  do  not  attempt  to  analyze  it  in  a  rigorous  manner,  but  give  the  intuition 
by  suggesting  an  example  of  configuration.  We  still  hold  the  condition  that  every  attribute  is  binary  and 
independently  sampled  from  Bernoulli  distribution.  However,  in  contrast  to  the  simplified  version,  we  allow 
each  attribute  to  have  a  different  Bernoulli  parameter  as  well  as  a  different  attribute-attribute  affinity  matrix 
associated  wit  it.  The  formal  definition  of  this  model  is  as  follows: 

i 

P(a,j(u)  =  0)  =  Pj ,  P[u,v }  =  • 

3= 1 

The  number  of  parameters  here  is  41,  which  consist  of  p/s  and  @/s  for  j  =  1,  2,  •  ■  ■  .1.  For  convenience, 
we  denote  this  power-law  version  of  MAG  model  as  M(n,l,  p,Q)  where  p  =  {p\,---  ,  pi}  and  0  = 
{©i,  •  •  •  ,  0;}.  With  these  additional  parameters,  we  are  able  to  obtain  the  power  law  degree  distribution  as 
the  following  theorem  describes. 

Theorem  7.1  For  M(n,l,  p,Q),  if  y-f/-  =  )  for  $  >  ^ ien  the  degree  distribution 

satisfies  pk  oc  k~s~2  as  n  oo. 
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In  order  to  investigate  the  degree  distribution  of  this  model,  the  following  two  lemmas  are  essential. 
Lemma  7.2  The  probability  that  a  node  u  in  M (n,  l,  p.  0)  has  an  attribute  vector  a(u)  is 

l 

-  W)1KM=1} . 

2=1 

Lemma  7.3  The  expected  degree  of  node  u  in  M  (n,  l,  p.  0)  is 

i 

(n  -  1)  JJ  (piCti  +  (1  -  Pi)f3i)1{ai(u)=0}  {pifii  +  (1  -  pf)  . 

2=1 

By  Lemmas  17.21  and  1731  if  the  condition  in  Theorem  17.11  holds,  the  probability  that  a  node  has  the  same 
attribute  vector  as  node  u  is  proportional  to  (— <5)-th  power  of  the  expected  degree  of  u.  In  addition,  (— |)-th 
power  comes  from  the  Stirling  approximation  for  large  k.  This  roughly  explains  Theorem  17. 11 
The  proof  is  given  in  Appendix  and  the  result  is  also  verified  by  simulation  in  Figure  [5] 


8  Simulation 

In  the  previous  sections,  we  performed  theoretical  analysis  of  the  MAG  model.  In  this  section,  we  use  simu¬ 
lation  experiments  to  further  demonstrate  the  properties  of  networks  that  arise  from  the  MAG  model.  First, 
we  generated  synthetic  MAG  graphs  with  varying  parameter  values  to  explore  how  the  network  properties 
change  as  a  function  of  those  parameters.  We  focus  on  the  change  of  scalar  network  properties,  like  diameter 
and  the  fraction  of  nodes  in  the  largest  connected  component  of  the  graph,  as  a  function  of  the  model  param¬ 
eter  values.  Second,  we  also  ran  simulations  with  fixed  parameter  configurations  to  check  other  properties 
of  MAG  model  that  we  did  not  theoretically  analyze.  In  this  way,  we  are  able  to  qualitatively  compare  our 
model  to  a  real-world  network. 

8.1  MAG  model  parameter  space 

Here  we  focus  on  the  simplified  version  of  the  MAG  model  and  examine  how  various  network  properties 
vary  as  a  function  of  parameter  settings.  We  fix  all  but  one  parameter  and  vary  the  remaining  parameter.  We 
vary  p,  a,  f,  and  n  in  M(n,  l ,  p,  0),  where  a  is  the  first  entry  of  the  affinity  matrix  0  =  [a  /3;  ft  7]  and  / 
indicates  a  scalar  factor  of  0,  i.e.,  0  =  /  •  0o  for  a  constant  ©o  =  [on  fio ;  Po  70]. 

Figure  [3] depicts  the  number  of  edges,  the  fraction  of  nodes  in  the  largest  connected  component,  and  the 
effective  diameter  (the  90th-percentile  distance  |28ll )  of  the  network  as  a  function  p.  a,  f,  and  n  for  fixed 
l  =  8.  First,  we  notice  that  the  growth  of  network  in  the  number  of  edges  is  slower  than  exponential  since 
the  curves  on  the  plot  grow  sub-linearly  in  Figure  [3ja)  with  log  scaled  y-axis.  Note  that  the  network  size 
is  roughly  proportional  to  n 2  ( p2a  +  2// ( 1  —  p)/3  +  (1  —  p)2 7V  from  Theorem  13.11  For  example,  by  this 
formula,  the  network  size  is  proportional  to  the  Z-th  power  of  /,  i.e.,  the  eighth  power  of  /  in  this  case.  As  the 
expected  number  of  edges  is  a  polynomial  function  of  each  variable  (p,  a,  f  and  n),  this  sublinear  growth 
on  the  log  scale  agrees  with  our  analysis.  Furthermore,  the  larger  the  degree  of  the  polynomial  function  for 
each  variable  is,  the  closer  to  the  straight  line  the  network  size  curve  becomes.  For  instance,  the  network  size 
grows  by  the  polynomial  function  of  degree  16  over  p,  whereas  it  grows  by  degree  2  over  n.  In  Figure  [3} a), 
we  thus  observe  that  the  network  size  growth  over  p  is  even  closer  to  the  exponential  curve  than  that  over  n. 
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Second,  in  Figure  0b),  the  size  of  the  largest  component  shows  a  sharp  thresholding  behavior,  which 
indicates  a  rapid  emergence  of  the  giant  component.  This  is  very  similar  to  thresholding  behaviors  observed 
in  other  network  models  such  as  the  Erdos-Renyi  random  graphs  model  lfl4ll.  The  vertical  line  in  the  middle 
of  each  figure  represents  the  theoretical  theshold  for  the  unique  giant  connected  component.  As  we  analyzed, 
each  network  contains  at  least  half  size  of  giant  connected  component  at  its  threshold. 

Last,  while  the  previous  two  network  properties  monotonically  change,  in  Figure  [3}c)  the  effective 
diameter  of  the  network  increases  quickly  up  to  about  the  point  where  the  giant  connected  component  forms 
and  then  drops  rapidly  after  that  and  approaches  a  constant  value.  This  behavior  is  in  accordance  with 
empirical  observations  of  the  “gelling”  point  where  the  giant  component  forms  and  the  diameter  starts  to 
decrease  in  the  evolution  of  real-world  networks  12811331. 

Furthermore,  we  also  performed  simulations  where  we  fix  0  and  p  but  simultaneously  increase  both  n 
and  l  by  maintaining  their  ratio  constant.  Figure  [4] plots  the  change  in  each  network  metric  (network  size, 
fraction  of  the  largest  connected  component,  and  effective  diameter)  as  a  function  of  the  number  of  nodes  n 
for  different  values  of  p.  Each  plot  effectively  represents  the  evolution  of  the  MAG  network  as  the  number 
of  nodes  grows  over  time.  From  the  plots,  we  see  that  MAG  model  follows  densification  power  law  (DPL) 
and  the  shrinking  diameter  properties  of  real-world  networks  l(28l.  Depending  on  the  choice  of  p,  one  can 
control  the  rate  of  densification  and  the  diameter. 

8.2  Degree  Distributions 

In  addition  to  the  network  size,  connectivity,  and  diameter,  we  also  examined  the  degree  distributions  of 
MAG  graphs  empricially.  We  already  proved  that  the  MAG  model  can  give  rise  to  networks  that  have  either 
a  log-normal  or  a  power-law  degree  distribution  depending  on  the  model  parameters.  Here  we  generate  the 
two  versions  of  networks  and  compare  their  degree  distributions. 

Figure  [5]  exhibits  the  degree  distributions  of  the  two  types  of  MAG  model.  While  Figure  [3a)  plots  the 
degree  distributions  of  the  simplified  MAG  model  M(n,  Z,  /z,  0),  Figure  Ob)  shows  those  of  the  power- 
law  MAG  model  M(n,  l,  p,  0).  For  each  case,  the  left  plot  represents  the  raw  form  of  degree  histogram, 
whereas  the  right  curve  plots  the  complementary  cumulative  distribution  (CCDF),  which  nicely  removes 
the  noisy  factor.  In  Figure  [3a),  both  raw  and  CCDF  versions  of  distribution  look  parabolic  on  the  log- 
log  scale,  which  verifies  that  M(n,l,  p,Q)  has  a  log-normal  degree  distribution.  On  the  other  hand,  in 
Figure[3b),  both  plots  exhibit  the  straight  line  on  the  same  scale,  which  indicates  that  the  degree  distribution 
of  M(n,l,p ,  0)  follows  a  power-law.  All  these  experimental  results  agree  with  our  analyses  in  Section [6] 
and  Section  [3 

8.3  Comparison  to  Real-world  Networks 

Also,  we  qualitatively  compare  the  structural  properties  of  a  specific  real-world  network  and  the  corre¬ 
sponding  MAG  model.  This  leads  to  interesting  questions  of  how  to  find  optimal  MAG  model  parameters 
so  that  synthetic  network  resembles  the  given  real-world  network.  The  full  resolution  of  these  questions 
lies  beyond  the  scope  of  the  present  paper;  currently,  we  searched  by  brute  force  over  (the  relatively  small 
number  of)  possible  MAG  parameter  settings.  We  manually  selected  some  parameter  settings  (for  n,l,  p,  0) 
to  synthesize  the  simplified  MAG  model  and  obtained  the  properties  of  M(n.  I,  p.  0)  to  compare  the  MAG 
model  with  a  real-world  network.  Our  goal  is  not  to  claim  that  these  particular  parameter  values  are  in  any 
way  “optimal”  for  the  given  real-world  network  but  rather  to  show  that  many  properties  of  the  MAG  model 
exhibit  qualitatively  similar  behavior  as  real-world  networks. 
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Figure  3:  Structural  properties  of  a  simplified  MAG  model  M(n ,  l,  fi.  0)  when  we  fix  l  and  vary  a  single 
parameter  one  by  one:  /i,  a ,  /,  or  n.  As  each  parameter  increases,  in  general,  the  synthetic  network  becomes 
denser  so  that  a  giant  connected  component  emerges  and  the  diameter  decreases  to  approach  a  constant. 
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Figure  4:  Structural  properties  of  a  simplified  MAG  graph  as  a  function  of  the  number  of  nodes  n  for 
different  values  of  p  (we  fix  the  affinity  matrix  0  =  [0.85  0.7;  0.7  0.15]  and  the  ratio  p  =  l /  log  n  =  0.596). 
Observe  not  only  that  the  relationship  between  the  number  of  edges  and  nodes  obeys  Densification  Power 
Law  but  also  that  the  diameter  begins  shrinking  after  the  giant  component  is  formed  lf33ll. 
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(a)  Simplified  MAG:  M(n,  l,  n,  0).  Log-normal  degree  distribution. 


(b)  Power-law  MAG:  M(n,  l,  /2,  0).  Power-law  degree  distribution. 


Figure  5:  Degree  distributions  of  simplified  and  power-law  version  of  MAG  graphs  (see  Section [7]).  We  plot 
both  PDF  and  CCDF  of  the  degree  distribution.  The  simplified  version  in  Figure  (a)  has  parabolic  shape  on 
log-log  scale,  which  is  an  indication  of  a  log-normal  degree  distribution.  In  contrast,  the  power-law  version 
in  Figure  (b)  shows  a  straight  line  on  the  same  scale,  which  demonstrates  a  power-law  degree  distribution. 
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For  the  real-world  network,  we  use  the  Yahoo  I-Flickr  online  social  network  on  10,240  nodes  and  44,800 
edges.  For  the  simplified  MAG  model  M(n,  l ,  fi,  0),  we  used  l  =  8,p  =  0.45,  0  =  [0.85  0.30;  0.30  0.25] 
with  the  same  number  of  nodes  n  =  10,  240.  Figure  Oa)  and  (b)  illustrate  the  following  properties  of  the 
real-world  and  the  corresponding  synthetic  network  of  the  simplified  MAG  model  (in  the  same  order  of 
figures). 

•  Degree  distribution  is  a  histogram  of  the  number  of  edges  of  a  node  fl5l. 

•  Singular  values  indicate  the  singular  values  of  the  adjacency  matrix  versus  their  rank  fl6l. 

•  Singular  vector  represents  the  distribution  of  components  in  the  left  singular  vector  associated  with 
the  largest  singular  value  lU2l. 

•  Clustering  coefficient  represents  the  degree  versus  the  average  (local)  clustering  coefficient  of  nodes 
of  a  given  degree  ll43l. 

•  Triad  participation  indicates  the  number  of  triangles  that  a  node  is  adjacent  to.  It  measures  the  transi¬ 
tivity  in  networks  11411 . 

•  Hop  plot  shows  the  number  of  reachable  pairs  of  nodes  as  the  number  of  hops.  It  sketches  how  quickly 
the  network  expands  Il40il27l. 

Figure  [6]  reveals  that  the  plots  of  properities  of  MAG  model  resemble  those  of  Yahoo  !-Flickr  network. 
Notice  qualitatively  similar  behavior  of  nearly  all  properties  between  Figure  0 a)  and  (b).  The  only  property 
where  the  simplified  MAG  model  does  not  match  the  Yahoo  !-Flickr  network  seems  to  be  the  clustering 
coefficient.  As  in  real-world  networks  high  degree  nodes  tend  to  have  lower  clustering,  in  the  simplified 
MAG  model  the  situation  is  reverse  -  higher  degree  nodes  also  tend  to  have  higher  clustering.  This  is 
due  to  the  fact  that  for  all  attributes  we  use  the  same  affinity  matrix  0  which  represents  only  the  core- 
perphery  structure  (a  >  (5  >  7).  Thus,  the  simpified  MAG  model  can  only  resemble  the  overall  core¬ 
periphery  shape  of  real-world  networks.  However,  in  the  Yahoo  !-Flickr  network,  we  can  also  discover  the 
local  clustering  effect  of  homophily  and  network  community  formation,  which  views  the  network  in  the 
opposite  way  compared  to  the  global  core-periphery  structure. 

Hence,  our  hypothesis  is  that  the  local  clustering  of  nodes  would  naturally  emerge  by  mixing  core¬ 
periphery  affinity  matrices  (a  >  f3  >  7)  and  homophily  affinity  matrices  (a,  7  >  fi).  To  investigate  this,  we 
also  generated  the  synthetic  network  with  more  general  version  of  MAG  model,  M(n,  l,  jl.  0).  Figure  [6jc) 
illustrates  the  network  properties  of  this  general  version.  Note  that  this  general  version  of  the  model  nicely 
captures  the  heavy-tailed  cluestering  coefficient  distribution  that  the  real-world  network  shows  while  the 
simplified  version  cannot.  For  the  other  properties,  the  general  version  still  exhibits  distributions  which 
qualatatively  seem  similar  to  those  of  the  real-world  network. 

By  this  experiment,  we  can  find  that  MAG  model  is  capable  of  representing  real-world  networks.  Fur¬ 
thermore,  we  verify  the  flexibility  of  MAG  model  in  a  sense  that  it  can  give  rise  to  networks  with  different 
network  properties  depending  on  the  MAG  model  parameter  configuration. 

9  Conclusion 

We  presented  the  Multiplicative  Attribute  Graph  model  for  real-world  networks  which  considers  the  cate¬ 
gorical  node  attributes  as  well  as  the  affinity  of  link  formation  depending  on  the  values  of  node  attributes. 
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(a)  Yahoo  !-Flickr  network  (b)  Simplified  MAG  model  (c)  General  MAG  model 


Figure  6:  The  comparison  of  network  properties  between  real-world  Yahoo  FFlickr  online  social  network, 
a  simplified  MAG  model  network,  and  a  general  version  of  MAG  model.  Except  for  clustering  coefficient, 
the  properties  of  MAG  model  qualitatively  resemble  those  of  the  Yahoo  I-Flickr  network  even  when  it  is 
the  simplified  version  in  Figure  (b).  Moreover,  the  general  version  of  the  MAG  model  can  represent  all  six 
network  properties  of  similar  shape  to  real-world  networks  in  Figure  (c). 
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We  introduced  the  attribute-attribute  affinity  matrix  to  represent  the  affinity  of  link  formation  and  provide 
the  flexibility  in  the  network  structure. 

On  the  other  hand,  the  MAG  model  is  both  analytically  tractable  and  statistically  interesting.  In  this 
paper,  we  analytically  showed  several  network  properties  observed  in  real-world  networks.  We  proved  that 
the  MAG  model  obeys  the  Densification  Power  Law.  We  also  showed  both  the  existence  of  unique  giant 
connected  component  and  a  small  diameter  in  the  MAG  model.  Furthermore,  we  mathematically  analyzed 
that  the  MAG  model  give  rise  to  networks  with  either  a  log-normal  or  a  power-law  degree  distribution. 
Finally,  we  emprically  verified  our  analytical  results. 

The  MAG  model  is  statistically  interesting  in  a  sense  that  it  can  represent  various  types  of  network 
structure  as  well  as  lead  a  problem  that  aims  to  find  such  structures  of  the  given  real-world  networks  in 
terms  of  the  MAG  model  parameters.  However,  we  leave  the  parameter  fitting  problem  as  a  venue  of  the 
future  work.  Furthermore,  future  work  includes  other  kinds  of  problems  such  as  how  to  find  underlying 
network  structures  and  missing  node  attributes  where  node  attributes  are  partially  observed. 
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A  Appendix:  The  Number  of  Edges 


Proof  of  Lemma  l3T2t  Let  N®v  be  the  number  of  attributes  that  take  value  0  in  both  u  and  v.  For  instance,  if 
a(u)  =  [0010]  and  a(v)  =  [0110],  then  N®v  =  2.  We  similarly  define  N^v  as  the  number  of  attributes 
that  take  value  1  in  both  u  and  v.  Then,  N°v,  A\\v  >  0  and  N°v  +  N*v  <  l  as  l  indicates  the  number  of 
attributes  in  each  node. 

By  definition  of  MAG  model,  the  edge  probability  between  u  and  v  is 

nr  l  N°  ol—N°  —N1  N1 

l\U  V\  -  (X  uv  jj  UV 

Since  both  N®v  and  N^v  are  random  variables,  we  need  their  conditional  joint  distribution  to  compute 
the  expectation  of  the  edge  probability  P[u,v]  given  the  weight  of  node  u.  Note  that  N®v  and  N^v  are 
independent  of  each  other  if  the  weight  of  u  is  given.  Let  the  weight  of  u  be  i,  i.e.,  u  €  W{.  Since  u  and 
v  can  share  value  0  only  for  the  attributes  where  u  already  takes  value  0,  N®v  equivalently  represents  the 
number  of  heads  in  i  coin  flips  with  probabiltiy  //.  Therefore,  N®v  follows  Bin(i ,  //) .  Similarly,  Nf.  follows 
Bin(l  —  1,1  —  p).  Hence,  their  conditional  joint  probability  is 

f l  —  P 


P(Kv,N^v\ueWi)  = 


N° 

1  UV 


pN°v(l  —  p) 


i—N° 


N1 

uv 


.l—i—N},, 


P 


(i-M) 


NL 


Using  this  conditional  probability,  we  can  compute  the  expectation  of  P[u,  v]  given  the  weight  of  u : 

E  [P  [u,  v }  | u  €  Wi]  =  E  u  €  Wi 


no  )  Cm MN°V  (U  -  pW  (pP)1  i  NL  (U  -  p)i)nL 

N°  ==0  N1  =0  K  uv' 

UV  w  UV 


E  E 


E 


’  )  {up)N™  ((i  -  p)py  N°v 

N°  =0  x  uv' 

1  '  11  1! 


E  Cmi')  W)‘— "”((!- «)t)' 

ni  =0  \lyuv. 


=  {pa  +  (1  -  p)/3f  {p/3  +  (1  -  p)y) 


l—i 
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Proof  of  Lemma  l33l  By  Lemma [3721  and  the  linearity  of  expectation,  we  sum  this  conditional  probability 
over  all  nodes  and  result  in  the  expectation  of  the  degree  given  the  weight  of  node  u.  ■ 


Proof  of  Theorem  I3.lt  We  compute  the  number  of  edges,  E  [m\,  by  adding  up  the  degrees  of  all  nodes 
described  in  Lemma [331 


E  [m]  =  E 


\  Y  des(u) 


2 

L  uev 


-nYP{W,)ndeg{u)\u  £Wj\ 


j= o 


3=0 


nY  (  ^  ^  ~  ^  [deg(u)\u  €  Wj] 


\nY(1-)  ((n  “  !)  +  (!  -  fl)P)3  (vP  +  (!  -  Vh)1  3  (1  -  ^)l 

3=0  W 

n(yU0  ^  (^2«  +  2ju(l  -  n)P  +  (1  -  m)27)Z  +  n  (jua  +  (1  -  n)if  . 


Proof  of  Corollary  13.3. It  Suppose  that  l  =  logn  for  £  =  fi2a  +  2/r(l  —  fj,)/3  +  (1  —  /i)27  and 

e  >  0.  By  Theorem  13.  li  the  expected  number  of  edges  is  0  (n2(l).  Note  that  logC  <  0  since  £  <  1. 
Therefore,  the  expected  number  of  edges  is 

0(nV)  =  0  (V+^)  =  0(n1+elogC)  =  o(n) . 


Proof  of  Corollary  13.3.21  Under  the  situation  that  l  €  o(log  n),  the  expected  number  of  edges  is 

0(n2CZ)  =  0(n2+(l^)log<)  =  0(n2+o(1)logC)  =  0(n2~o(1)) . 


B  Appendix:  Connectivity 

Since  Theoreml4.3lis  used  to  prove  other  theorems,  we  begin  with  the  proof  of  it. 

Proof  of  Theorem  14.31  If  j  >  i,  for  any  v  G  W; ,  we  can  generate  a  node  7  Wj  from  v  by  flipping 
(j  —  i)  attribute  values  that  originally  take  1  in  v.  For  example,  if  a(v)  =  [0110],  then  a(y^)  =  [0010] 
or  [0  1  0  0].  Hence,  P[u,  v^]  >  P[u,  v\  for  v  €  Wj. 
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Here  we  note  that  E  [P  [it,  i/-^]  |n  €  Wi]  =  E  [P  [  it,  |i;W)  £  Wj\ ,  because  each  can  be  gen¬ 

erated  by  (■[)  different  a(v)  sets  with  the  same  probability.  Therefore, 


E  [P  [it,  v]  |n  €  Wj\  =  E 


E 


P 


it,  n 


(j) 


In  €  Wi 


>  E  [E  [P  [ it ,  n]  |n  €  Wi]]  =  E  [P  [it,  v]  \v  €  Wf] 


Next  theorem  plays  a  key  role  in  proving  Theorem  14.  lias  well  as  Theorem  14.21 

Theorem  B.l  Let  \  Sj  \  £  0(n)  and  E  [P  [u,  V\it]  |it  £  Wj]  >  clog  nasn  ^  oo  for  some  j  and  sufficiently 
large  c.  Then,  Sj  is  connected  with  high  probability  as  n  — >•  oo. 

Proof:  Let  S'  be  a  subset  of  Sj  such  that  S'  is  neither  an  empty  set  nor  Sj  itself.  Then,  the  expected  number 
of  edges  between  S'  and  Sj\S'  is 

E  [P  [S',  Sj\S ']  |  |S'|  =  k]  =  k  •  (|Sj|  -  k)  •  E  [P  [it,  v\  \u,  v  £  Sj] 

for  distinct  u  and  v.  By  Theorem  14.31 

E  [P  [it,  v]  |it,  v  £  Sj]  >  E  [P  [it,  v\  |u  £  Sj,  v  £  V] 

>  E  [P  [it,  v]  [it  £  Wj,v  €  H\it] 
clogn 


> 


n 


Given  the  size  of  S'  as  k,  the  probability  that  there  exists  no  edge  between  S'  and  Sj\S'  is  at  most 
exp  (— ^E  [P  [S',  Sj\S']  1 1 S' |  =  A;])  by  Chernoff  bound.  Therefore,  the  probability  that  Sj  is  disconnected 
is  bounded  as  follows: 

P(Sj  is  disconnected)  <  P( no  edge  between  S',  Sj\S ') 

S'cS^S'^Sj 
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E  exp[-iE[P[P',5i\S,]||5'|]) 

?  Qf-A0\  Q.  V  / 


S'cS,S'^,S. 


<  E  exP  ( -\s’\  (I'S’il  -  \s'\) 

S'cS,S'^(/>,Sj 


(/  s  clogn 


2  n 
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2  E 


l<k<\Sj\/2 


i^i)  exp  LdSTp* 
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c|Sj|  logn 
4n 
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<2  E  expfYlog|  Sj 

t<k<\Sj\/2  ^  ^ 

=  2  E  exp  (— /c0(logn))  (v  |5j|  £  0(n)) 

1  <  fc  <  |  S'j  |  /  2 

/i  \k 

=  2  E 


\<k<\Sj\/2 
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©(I) 


n0(!) 
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as  n  oo.  Therefore,  Sj  is  connected  with  high  probability. 


Now  we  turn  our  attention  to  the  giant  connected  component.  To  show  its  existence,  we  investigate 
■$'///+/ 1  /e ,  and  S^l+l2/3  depending  on  the  situation.  The  following  lemmas  tell  us  the  size  of  each  subgraph. 


Lemma  B.2  | S^j  |  >  f  ~  o(n)  with  high  probability  as  n  — >•  oc. 

Proof:  By  Central  Limit  Theorem,  -4=  =  ~  iV(0, 1)  as  n  — >•  oo,  i.e.,  I  — >  oo.  Therefore,  P(\u\  >  pi)  is 

VW  i-m) 

at  least  \  —  o(l)  so  1 5)^ |  >  f  —  o(n)  with  high  probability  as  n  — >•  oo.  ■ 


Lemma  B.3  |S/Jj!+ii/6 1  G  0(n)  with  high  probability  as  n  — >•  oc. 

Proof:  By  Central  Limit  Theorem  mentioned  in  Lemma lfL2l 

j  1/6 

P(pl  <  |it|  <  pi  +  l1'*)  «  <&(  )  -  <3?(0)  €  o(l) 

VW1  ~  b) 

as  l  — >•  oo  where  #(z)  represents  the  cdf  of  the  standard  normal  distribution. 

Since  P(|it|  >  pi  +  Z1/6)  is  still  at  least  ^  —  o(l),  the  size  of  S  l+p/ 6  is  0(n)  with  high  probability  as 
l  — >•  oo,  i.e.,  n  — >•  oo.  ■ 


Lemma  B.4  |S/j;+i2/3 1  €  o(n)  with  high  probability  as  n  — >•  oo. 

Proof:  By  Chernoff  bound,  P(|it|  >  pl+l 2/3)  iso(l)  as  /  — »•  oo,thus  |S)ti+;2/3|  is  o(n)  with  high  probability 
as  n  — >•  oo.  ■ 

Using  the  above  lemmas,  we  show  the  existence  and  the  uniqueness  of  the  giant  connected  component  under 
the  given  condition. 

Proof  of  Theoreml4.lt 


(Existence).  First,  if 


(pa  +  (1  —  p)/3)^  (p/3  +  (1  —  p)'})1  M  >  7T.  then  by  Lemmal3.31 


E[P[u,V\u]\uGW^} 


(pa  +  (1  -  p)PY  (pi 3  +  (1  -  p) 7) 


1— M 


-  p~  log  n 


=  (1  +  e)logn  >  clog 


n 


for  some  constant  e  >  0  and  c  >  0.  Since  [SL;|  €  0(n)  by  Lemma  IB.2I  S/j  is  connected  with  high 
probability  as  n  — >•  00  by  TheoremlB.il  In  other  words,  we  are  able  to  extract  out  a  connected  component 
of  size  at  least  j  —  o(n). 


Second,  when 


(pa  +  (1  —  p)PY  ( pf3  +  (1  —  p) 7)1  M  =  4  we  can  apply  the  same  argument  for 
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S/d+li/e.  Because  |S  j+ji/6|  G  0(n)  by  Lemma  HOI 


E 


P  [u,  VV]  | u  G  W^+ii/6 


{na  +  (1  -  /i)/ 3)M  (/i/3  +  (1  -  /i)7) 


l-M 


-  pi  log  n 


_  //i«+(i-/i)/3ypiogn)1/6 

V/i/3  +  (l-/i)7/ 

=  (l  +  eyi°g^1/6 


/  /iQ+  (l  -  /i)/3 
V/i/3  +  (!  -/i)7 


(plogn)1/6 


which  is  also  greater  than  clogn  as  n  — »■  oo  for  some  constant  e'  >  0.  Thus,  S^l+l i/6  is  connected  with 
high  probability  by  Theorem  IB.  II 

Last,  on  the  contrary,  when  (//a  +  (1  —  /i)/3)M  (/i/3  +  (1  —  /i)7)1_M  <  |,  for  u  G  W^+p/3, 


E 


P  [«,  V\u]  \u  G  W^l+l2/3 


(fiat +  (1-  /i)/3)M  (/i/3  +  (1  -  /i)7) 


t-p 


i  p  i 


losn  /  fxa  +  (1  —  /i)/3\ 
/i/3  +  (1  -/i)7/ 


(plogn)2/3 


(1  -  e*)' 


p— 2/3(i0gn)1/3  ^  fj,a  +  (1  —  /i)/3 
/i/3  +  (1  -  /i)7 


(plogn)2/3 


is  o(l)  as  n  — >•  oo  for  some  constant  e"  >  0.  Therefore,  by  Theorem  14. 3 1  the  expected  degree  of  a  node  with 
weight  less  than  /iZ  + Z2/ 3  is  o(l).  However,  since  S/ll+l2/3  is  o(n)  by  Lemma  lB~4l  n  —  o{n )  nodes  have  less 
than  /iZ  +  Z2/3  weights.  Hence,  most  of  n  —  o(n)  nodes  are  isolated  so  that  the  size  of  the  largest  component 
cannot  be  0(n). 


(Uniqueness).  We  already  pointed  out  that  either  S^i  or  S  i/e  is  the  subset  of  0(n)  component  when 
the  giant  connected  component  exists.  Let  this  component  be  H.  Without  loss  of  generality,  suppose  that 
,5’/(/  C  //.  Then,  for  any  fixed  node  u, 

P[u,H]>P[u,S ^  (v^Cfl) 

=  |S'M/|  •  E  [P  [ii,n]  |n  G 

>  |5^|  •  E  [P  [u,  n]  \v  G  (By  Theorem  |4"3]) 

Since  U\P  c 

E  [P  [u,  U\P]]  <  E  [P  [u,  E  [«,  #  ]] 

holds  for  every  mGT 
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Suppose  that  another  connected  component  H'  also  contains  0(n)  nodes.  We  will  show  the  condtradic- 
tion  if  H  and  H'  are  not  connected  with  high  probability  as  n  — »  oo.  To  see  E  [ P  [ H ,  H']\, 

E  [P  [ H ,  H']\  =  \H'\  •  E  [P  [u,  H]  | u  €  H'] 

>  [- P  [«,  H']  \u  €  H']  (•_•  if'  C  C  V\S^) . 

However,  E  [P[u,Hr]  \ u  €  H']  €  H(l).  Otherwise,  since  the  probability  that  u  €  H'  is  connected  to 
H'  is  not  greater  than  E  \P  \u.  H'\  u  €  H']  by  Markov  Inequality,  u  is  disconnected  from  H'  with  high 
probability  as  n  oo.  H'  thus  includes  at  least  one  isolated  node  with  high  probability  as  n  oo.  This  is 
contradiction  to  the  connectedness  of  H' . 

On  the  other  hand,  if  E  [P  [it,  H']  \u  €  H']  €  0(1),  then  E  [P  [H,  H'}\  €  0(n).  In  this  case,  by  Chernoff 
bound,  //  and  IV  are  connected  with  high  probability  as  n  —>  oo.  This  is  also  contradiction.  Therefore, 
there  is  no  0(n)  connected  component  other  than  H  with  high  probability  as  n  — >•  oo.  ■ 

Next,  the  proofs  for  the  connectedness  follow.  Before  the  main  proof,  we  present  some  necessary  lemmas 
and  prove  them. 


/ 1  \ L—X 

Lemma  B.5  (^)  ( -j— )  is  a  monotonically  increasing  function  of  x  over  (0,  /;). 

Proof:  Let  f(x)  be  the  log-value  of  the  given  function,  i.e., 

f(x)  =  x  (log  /i  —  log  x)  +  (1  —  x)  (log  (1  —  y)  —  log  (1  —  x))  . 

To  take  the  derivative  of  f(x), 

f'{x)  =  (log  n-  log  x)  +  (log  ( 1  -  x)  -  log  (1  -  fj,))  . 

Since  x  <  fi  and  1  —  //  <  1  -  x,  fix)  >  0  where  0  <  x  <  /i.  This  implies  that  f(x)  is  strictly  increasing, 
so  the  given  function  is  also  strictly  increasing  over  (0,  /r).  ■ 


Lemma  B.6  If  (1  —  /i)1’  >  then  Vmhi  o  with  high  probability  as  n  — >•  oo.  Otherwise,  if  { 1  —  p)p  < 
jj  with  high  probability  as  n  oc  where  u  is  a  solution  of  the  equation 

in  (0,  p). 

Proof:  First,  we  assume  that  (1  —  p)p  >\  ,  which  indicates  n(l  —  p)p  >  1  by  defition.  Then,  the  probability 
that  |  Wf  =  0  is  at  most  exp(— -yE  [|Wi|])  by  Chernoff  bound.  However,  for  fixed  //, 

E  [|Wi|]  =  nQ/(  1  -  p)1-1  >  G  0(1) . 

Therefore,  by  Chernoff  bound,  P(|VFi|  =  0)  — »  0  as  l  — >•  cx).  This  implies  that  V(nin  is  o(l)  with  high 
probability  as  n  oo. 
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Second,  we  look  at  the  case  that  (1  —  y)p  <  Jj.  For  any  e  €  (0,  y  —  v),  to  use  Stirling’s  approximation, 

E  [\W(u+e)l\]  «  j  ^^(1  -  F)(1-("+e))Z 

^ _ 

+  e)Z(^)("+e)V27r(l  -  (1/  +  e))  ((lzkt£lli) (1~("+6)); 

n 

y/2 irl(u  +  e)  (1  -  (l>  +  e)) 


lJ- 


v  +  e 


v+t 


1  “F 


1-  U'  +  e 


l-(H-e)' 


Since  (£) 


/£^  f  1— /X 
1— X 


1 — cc 


is  a  increasing  function  of  x  over  (0,  /i)  by  Lemma [B31 


1\  1//p 


'Ir,-1/* 


for  some  constant  e'  >  0.  Therefore, 

E[\W(u+e)l\]  = 


(1  +  0 


/\i 


y/2Trl(u  +  e)  (1  -  {v  +  e)) 


exponentially  increases  as  l  increases.  By  Chernoff  bound,  |  W(v+e\i  |  is  not  zero  with  high  probability  as 
l  — >•  oo,  i.e.,  n  — >•  oo. 

In  a  similar  way,  E  [|FF(iy_e);|J  =  — ^  =  v  ==  ^  ^  exponentially  decreases  as  l  increases.  Since 

IE  [\Wi\]  >  E  \\Wj\]  if  yl  >  i  >  j,  the  expected  number  of  nodes  with  at  most  weight  (v  —  e)l  is  less  than 
{y  —  e)l  E  [|l4,r(jy_f)/|]  and  its  value  goes  to  zero  as  l  — >•  oo.  Hence,  by  Chernoff  bound,  there  exists  no  node 
of  the  weight  less  than  (y  —  e)l  with  high  probability  as  n  oo. 


To  sum  up,  goes  to  v  with  high  probability  as  l  — >•  oo,  i.e.,  n 


oo. 


Using  the  above  lemmas,  we  show  the  condition  that  the  network  is  connected. 
Proof  of  Theorem  14.21  Let  — >•  t  for  a  constant  t  €  [0,  y)  as  n  — >•  oo. 


If 


t/..o  ,  ( i  t  >  1.  by  Lemma [331 


(ya  +  (1  -  y)P)  (y/3  +  (1  -  y)j) 

E  [P  [ u ,  V\u]  |n  €  WVain]  «  E  [P  [u,  V\u]  | u  €  Wtl] 


(ya  +  (1  -  y)/3f  {y(3  +  (1  -  y)j) 


t-t 


p-  log  n 


=  (1  +  e 
>  c  log  n 


for  some  e  >  0  and  sufficiently  large  c.  Note  that  SVmin  indicates  the  entire  network  by  definition  of  I4iin- 
Since  |SVmin|  is  0(n),  Symin  is  connected  with  high  probability  as  n  — >•  oo  by  Theorem  IB.  II  Equivalently, 
the  entire  network  is  also  connected  with  high  probability  n  oo. 
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vmin  *'  vmin  -i 

On  the  other  hand,  when  (pa  +  (1  —  p)(5)  losn  (/i/3  +  (1  —  p)y)  losn  <  -,  the  expected  degree  of  a 
node  with  |  I/min|  weight  is  o(l)  because  from  the  above  relationship  E  [P  [it,  V'\?/]  | u  €  Wym in]  ~  (1  — 
e')logn  for  some  e'  >  0.  Thus,  in  this  case,  some  node  in  Wymin  is  isolated  with  high  probability  so  the 
network  is  disconnected.  ■ 


C  Appendix:  Diameter 

Theorem  C.l  W  For  an  Erdos-Renyi  random  graph  G(n,p),  if  (pn)d  1  /n  — >•  0  and  ( pn)d/n  — >•  oo 
for  a  fixed  integer  d,  then  G(n.  p)  has  diameter  d  with  probability  approaching  1  as  n  — >•  oo. 


Proof  of  Lemma  15.31  Let  AG  and  A 11  be  the  probabilistic  adjacency  matrix  of  random  graphs  G  and  H, 
respectively.  If  AG  >  A^  for  every  i,j  and  H  has  a  constant  diameter  with  high  probability,  then  so  does 
G.  It  can  be  understood  in  the  following  way.  To  generate  a  network  with  AG,  we  first  generate  edges  with 
Ah  and  further  create  edges  with  {A(:  —  A11).  However,  as  the  edges  created  in  the  first  step  already  result 
in  the  constant  diameter  with  high  probability,  G  has  a  constant  diameter. 

Note  that  minU)„esAi  P[u,  v]  >  /3A/7'1  ~X)I.  Thus,  it  is  sufficient  to  prove  that  the  Erdos-Renyi  random 
graph  G(|<S\/|,  has  a  constant  diameter  with  high  probability  as  n  oc.  However, 


E[|fLAi|]/3AV1_A)Z 


iiM(i  _ 


n  ( b£\  Xl  ( (!  ~  bh\ (1  X)l 

^2-jtIX  (1  -  A)  V  A  /  V  1  -  A  / 


n 

y/2irl\(l  -  A) 
1 

y/2irl\(l  -  A) 
1 

^/271-ZA  (1  -  A) 


(/r/3  +  (1  -  p)y)1  (■:  A  = 

(1  +  e)logn 


(By  Stirling  approximation) 

/r/3  +  (1  -  p)  Y 


for  some  e  >  0. 

Since  this  value  goes  to  infinity  as  n  ->  oo,  so  does  E  (1TA/].  Therefore,  by  Chernoff  bound,  \W\j\  > 
cE  [W\i]  with  high  probability  as  n  — »  oo  for  some  constant  c.  Then, 


|5az|/3az7(1“A)Z  >  |WaZ|/3AZ7(1_A)Z 

>  cE[|WA;|]/3AZ7(1-A)i 

«  -  =  =(i  +  e)log" 

v/2vr/A(l  -  A) 


By  Theorem  1C.  11  an  Erdos-Renyi  random  graph  G(|5A/|,  — c^1,+e^  S  )  has  a  diameter  of  at  most 

|SAi|  y : 2irl\(l-\) 

(l  +  with  high  probability  as  n  — >  oo.  Thus,  the  diamters  of  G(|5A;|, /3AZ7^1_A^)  as  well  as  S\i  are 
also  bounded  by  a  constant  with  high  probability  as  n  — >•  oo.  ■ 
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Proof  of  Lemma  153}  For  any  u  €  V, 


PfaSu]  >  ]T  n(l)  1-^(1  ~  p)l~jPjil~j 


j=\i 

i 


j=\i 


Owi  _  ((1  -  th 


£"L  Ft'-*)' 


a;  V  1  —  A 


i-j 


AJ(i-A)'  J  (/x/3  +  (1  -  /ib)' 

,  —  \ 7  / 


j=xi 

=  (2(pP  +  (l  -M)7)P)l0gn 


£(>(i-a)' 

U=At 


-J 


By  Centeral  Limit  Theorem,  Y?j=\i  (j)AJ(  1  —  A/--5  converges  to  ^  as  l  — »  oo.  Therefore,  P[it,  S\;]  is 
greater  than  c  log  n  for  a  constant  c,  and  then,  by  Chernoff  bound,  u  is  directly  connected  to  S\i  with  high 
probability  as  n  — >■  oo.  ■ 


D  Appendix:  Degree  Distribution 

Theorem  D.l  Mi/  P  (deg(u)  =  k)  =  fueV  (V)  (E  [P  [ u ,  v]})k  (1  -  E  [P  [u,  v J])”-1-*  du  . 

Corollary  D.1.1  For  Ej  =  {pa  +  (1  -  p)P)3  {pP  +  (1  -  p)j)1  \ 

the  probability  of  degree  k  in  M(n,l ,  p,  0)  is  Pk  =  o  {j)^0-  —  h)l~3  (1  —  Ej)n~l~k  . 

Proof:  To  reformulate  TheoremlD.il 

P  (deg(u)  =  k)  =  p(u  G  Wj)  ~  ^  (E  [P  [u,v]  \u  €  Wj])k  (1  -  E  [P  [u,v]  \u  6  Wj])n~1~k  . 
j=o  '  ' 

Therefore,  by  applying  Lemma [3721  we  obtain  the  desired  formula.  ■ 

Proof  of  Theoreml6.ll  To  reduce  the  space,  we  begin  by  defining  some  notations  as  follow: 

x  =  pa  +  (1  —  p)P 

y  =  pP  +  (i  -  f)i 

fj(k)  =  ^  k  ^  ( x3y ^  (l  -  xJyl~ 

9j(k )  =  -  p)l~3fj(k ) . 

By  Corollary  ID.  1.11  we  can  restate  pk  as  Y^j=o9j(k)- 

If  most  of  those  terms  turn  out  to  be  insignificant  under  our  assumptions,  the  probability  pk  can  be 
approximately  proportional  to  one  or  few  dominant  terms.  In  this  case,  what  we  need  to  do  is  thus  to  seek 
for  j  that  maximizes  gj  ( k )  =  (/)  //A  ( 1  —  p)1  ffj  (k)  and  find  its  approximate  formula. 
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We  start  with  the  approximation  of  fj(k).  For  large  n  and  k,  by  Stirling  approximation, 

-v/2vrn(n/e)n  ( x3yl~3)k  (l  —  x3yl~3)n  k 
^  y/2Trk(k  /  e)k  2ir(n  —  k)  ((n  —  k)/e)n~k 

1  /  nx3yl~3  \  k  / 1  —  x3yl~3  \nk 

sj2nk{  1-|)  V  k  Ml  -kjn  ) 


However,  the  expected  degree  of  maximum  weight  node  is  0(n  (jig  +  (1  —  y)/3)1),  so  is  the  expected 
maximum  degree,  k  is  thus  o(n)  with  high  probability  as  n  —>  oo,  i.e.,  I  — >•  oo. 


/ 1  —  x3yl  3 
\  1-  k/n 


n—k 


exp 


k)x3yl  3  + (n 


exp  (—nx3yl  3  +  k) . 


For  sufficiently  large  l,  we  can  further  simplify  gj(k )  by  normal  approximation  of  the  binomial  distribu¬ 
tion: 


In 9j{k)  =  In  ^y3(l  -  y)1  3  +  In  fj(k) 


-1-  In  (2^(1  ~  A)  ~  Uu(L 


C- 


(j-y  0  +1  n/j(fc) 
k 


1  1 

— - — 7 - -  (j  —  nl)2 - In  k  —  A:  In - — : 

2ly(l  —  y)  2  nx3yl~3 


+  k  1  - 


nx3yl  3 


for  some  constant  C.  When  k  =  nxTy  r  for  r  >  yl  and  R  = 


In gj{k)  «  C  - 


(j  —  yl)2 - In  A;  +  k(j  —  t)  In  R  +  k  (l  —  R3  T) 


2ly(l  —  y)  w  r  2 
Using  (j  -  yl )2  =  (j  -  r)2  +  (r  -  yl )2  +  2(j  -  r)(r  -  yl ), 

In pj (A;)  k,  CT  —  -  +  (j  —  r)  (^A:  In i?  —  T  ^ 


2//r(l  —  /r) 


f//(l  -/i) 


^  +A:(l-f?J-r)  -  ilnA: 


for  CT  =  C- 


U-AtQ2 


2///(l— //) ' 

Considering  ^  (A;)  as  a  function  of  j,  not  A:,  now  we  find  j  that  maximizes  gj  (k)  for  k  =  nxTyl~T .  How¬ 
ever,  the  median  weight  is  approximately  equal  to  yl  by  Central  Limit  Theorem.  If  we  focus  on  the  higher 
half  degrees,  we  can  thus  let  r  >  yl.  In  this  case,  since  ( ya  +  (1  —  p)/3)M  (y/3  +  (1  —  y) rf)1~11  >  i, 


k  > 


(ya  +  (1  -  y)pY  (y/3  +  (1  -  y/j)1  ^ 


€  £1(1). 


If  we  differentiate  In  g:j  (k)  over  j, 

(ln « {k))' 50  -jyrxy) +  {k ln  ■ R  -  ~ kW “T  ln  ■ R  =  0 ■ 

Because  k  E  £1(1)  and  j ,  r  E  0(1),  we  can  conclude  that  R3~T  «  1  as  n  — >•  oo;  otherwise,  |  (ln  gj(k))' 
grows  as  large  as  £l(k).  Therefore,  when  j  «  r,  gj(k)  is  maximized. 
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Furthermore,  since  |  I  ^  ^ Ini?  as  n  — >  oo,  the  first  quadratic  term  in  ^n9j(k)  *s 

negligible.  Asaresult,  when  R  is  practical  (close  to  1.6  ~  3),  lngr+A  would  be  at  most  (©(— k\A\)  —  lngr) 
for  A  >  1.  After  all,  gT  effectively  dominates  the  probability  pk,  i. e. ,  In pk  is  roughly  proportional  to  In  gT. 
By  assigning  r  =  hik^^%y  ,  we  obtain 


In  pk  ~  C  - 
=  C'~ 


1 


2lp{\  —  p) 
1 


2lp(l  —  yr)(ln  R)2 


In  k  —  In  nyl  \  “  1 

— tafi - ~2lnk 


Ink  —  In  nyl  —  lp\nR  —  —  lp(l  —  /r)(ln  R)2 


—  In  k . 


for  some  constant  C' .  Therefore,  the  degree  distribution  pk  approximately  follows  the  log-normal  as  de¬ 
scribed  in  Theoreml6.ll  ■ 


E  Appendix:  Power-law  Distribution 


Proof  of  Lemma  [7721  Since  a,’s  are  independently  distributed  Bernoulli  random  variables,  Lemma  17721 
holds.  ■ 

Proof  of  Lemma  l73t  Let’s  define  Pj{u ,  v)  as  the  edge  probability  between  u  and  v  when  considering  only 
up  to  the  y -tli  attribute,  L  e. , 

3 

Pj(u,v )  =  PjBi  [ai(u),ai(v)]  . 

i—1 

Thus,  what  we  aim  to  show  is  that  for  a  node  v, 

i 

E  [. Pi(u,v )]  =  {piOti  +  (1  -  Pi)Pi)1{ai(-u)=0}  {pifii  +  (1  -  fij)7i)1{ai(“)=1}  • 

i—  1 

When  Z  =  1,  it  is  trivially  true  by  Lemma  [3721  When  l  >  1,  suppose  that  the  above  formula  holds  for 

l  =  1,2,- ••  ,  k.  Since  Pk+l(u,v)  =  Pk(u,  v)Qk+i  [ak+i(u),  ak+i{v)], 

E  [. Pk+l(u,v)\ 

=  E  [. Pk(u ,  n)]  E  [0fc+i[afc+i(n),  afc+i(n)]] 

=  E  [ Pk(u ,  n)]  (pk+iak+i  +  (1  —  pk+ilPk+i)1^^1^-^ (Hk+iPk+i  +  (1  - 
k-\- 1 

=  (/• H&i  +  (1  -  Pi)/3i)1{a,lyU)=()}  ( Pifii  +  (1  -  Mi)7i)1{ai(“)=1}  • 

i—  1 

Therefore,  the  expected  degree  formula  described  in  Lemma  1731  holds  for  every  l  >  1.  ■ 

Proof  of  Theorem  17.11  Before  the  main  argument,  we  need  to  define  the  ordered  probability  mass  of 
attribute  vectors  as  p^p  for  j  =  1,2,  •••  ,2l.  For  example,  if  the  probability  of  each  attribute  vector 
(00,  01, 10, 11)  is  respectively  0.2, 0.3,  0.4,  and  0.1  when  l  =  2,  the  ordered  probability  mass  is  p^\  =  0.1, 
P{ 2)  =  0.2,  P( 3)  =  0.3,  andp(4)  =  0.4. 
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Then,  by  Theorem  ID.  1[  we  can  express  the  probability  of  degree  k,  pk,  as  follows: 


Pk  = 


n  —  1 


3= 1 


C(1  -  E, 


\n—l—k 


(2) 


where  Ej  denotes  the  average  edge  probability  of  the  node  which  has  the  attribute  vector  corresponding  to 
P(jy  If  P(j)S  and  Ef  s  are  configured  so  that  few  terms  dominate  the  probability,  we  may  approximate  pk 
as  (n^1)p(T)(Sr)fc(  1  —  Er)n^1~k  for  r  =  argma ^-j P(j)  ( Ej)k  (1  —  Ej)n~l~k .  Assuming  that  this  approx¬ 
imation  holds,  we  will  propose  a  sufficient  condition  for  the  power-law  degree  distribution  and  suggest  an 
example  for  this  condition. 

To  simplify  computations,  we  propose  a  condtion  that  oc  EJS  for  a  constant  5.  Then,  the  j-th  term 
is 

)pW  (Erf  (1  -  E;)”-‘-‘  oc  (1  -  E,)"-'-6)  , 

which  is  maximized  when  Ej  ~  Moreover,  under  this  condition,  if  Ej+i/Ej  is  at  least  (1  +  z)  for 

a  constant  z  >  0,  then 

P( r+A)  (^+A)fc(l-gr+A)n~1~fc 
P(r)  (Erfil-Err-1-* 

is  o(l)  for  A  >  1  as  n  — >•  oo.  Therefore,  the  r-th  term  dominates  the  Equation  d2]). 

Next,  by  the  Stirling  approximation  with  above  conditions, 


fn  —  1 

V  k 


Pk 


n  —  1 
k 


k  —  5 
n  —  1  —  6 


k—S 


n  —  1  —  k 
n  —  1  —  5 


n—l—k 


l 


(k-sy 


cx 


\Jk{n  —  1  —  k) 

fc-6-i/2  exp(— 5) 


-5  ( (n  -  1  ){k  -  5) 
k(n  —  1  —  5) 


n  —  1 
n  —  1  —  5 


n—l—k 


for  sufficiently  large  k  and  n.  Thus,  /y.  is  approximately  proportional  to  k~ z~s  for  large  k  as  n  — >•  oc. 

Last,  we  prove  that  the  two  conditions  for  the  power-law  degree  distribution  are  simultaneously  feasible 
by  providing  an  example  configuration. 

If  every  p^  is  distinct  and  Ah—  =  ?  then  we  satisfy  the  condition  that  p^  oc  ( Ej)~s 

by  Lemma  17721  and  Lemma  1731  On  the  other  hand,  if  we  set  =  (1  +  z)~2  6 ,  then  the  other  condition, 
Ej+i  /Ej  >  (1+^)  is  also  satisfied.  Since  we  are  free  to  configure  //,’s  and  0,  ’s  independently,  the  sufficient 
condition  for  the  power  law  degree  distribution  is  feasible.  ■ 
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